AITopics | historical record

Collaborating Authors

historical record

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Download: how the World Cup ball will fly and OpenAI's "super app"

MIT Technology ReviewJun-8-2026, 12:10:00 GMT

The Download: how the World Cup ball will fly and OpenAI's "super app" Plus: OpenAI plans to turn ChatGPT into a'super app' before its IPO. Why this year's World Cup ball may not fly as far Much is new about this month's FIFA World Cup tournament. It hosts more teams than ever before. It's the first to occur in three different host countries. And, like every World Cup for over half a century, it will employ a football with a brand-new design. Through wind-tunnel experiments, researchers found that long-distance kicks with Adidas's new Trionda ball might not travel as far as they did in the past.

large language model, machine learning, natural language, (17 more...)

MIT Technology Review

Country: North America > United States (0.50)

Industry:

Government (0.77)
Leisure & Entertainment > Sports > Soccer (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.85)

Add feedback

b6b4906c1334656e97cc9968ccfca073-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 16:32:09 GMT

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Leisure & Entertainment (0.92)
Media > Film (0.67)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(4 more...)

Add feedback

HYDRA: Model Factorization Framework for Black-Box LLM Personalization Y uchen Zhuang

Neural Information Processing SystemsOct-10-2025, 14:13:17 GMT

Personalization has emerged as a critical research area in modern intelligent systems, focusing on mining users' behavioral history and adapting to their preferences

history, language model, personalization, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Leisure & Entertainment (0.92)
Media > Film (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(2 more...)

Add feedback

Oldest known dog breed reveals hidden human history

Breakthroughs, discoveries, and DIY tips sent every weekday. The Iditarod is the longest annual sled dog race– covering over 1,500 miles across Alaska. A close look into canine genetics reveals sled dogs have been around and on the move for thousands of years. Specifically, the Greenland sled dog–called Qimmeq (singular), or Qimmit (plural) in Greenlandic–has a history traceable all the way back 9,500 years to Zhokhov Island in Eastern Siberia. And they've been a distinct, isolated group for about 1,000 years of that time.

greenland, history, sled dog, (16 more...)

Popular Science

Country:

North America > Greenland (0.69)
North America > United States > Alaska (0.26)

Genre: Research Report > New Finding (0.68)

Industry: Leisure & Entertainment > Sports (1.00)

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

Open-Set Living Need Prediction with Large Language Models

Lan, Xiaochong, Feng, Jie, Sun, Yizhou, Gao, Chen, Lei, Jiahuan, Shi, Xinlei, Luo, Hengliang, Li, Yong

arXiv.org Artificial IntelligenceJun-4-2025

Living needs are the needs people generate in their daily lives for survival and well-being. On life service platforms like Meituan, user purchases are driven by living needs, making accurate living need predictions crucial for personalized service recommendations. Traditional approaches treat this prediction as a closed-set classification problem, severely limiting their ability to capture the diversity and complexity of living needs. In this work, we redefine living need prediction as an open-set classification problem and propose PIGEON, a novel system leveraging large language models (LLMs) for unrestricted need prediction. PIGEON first employs a behavior-aware record retriever to help LLMs understand user preferences, then incorporates Maslow's hierarchy of needs to align predictions with human living needs. For evaluation and application, we design a recall module based on a fine-tuned text embedding model that links flexible need descriptions to appropriate life services. Extensive experiments on real-world datasets demonstrate that PIGEON significantly outperforms closed-set approaches on need-based life service recall by an average of 19.37%. Human evaluation validates the reasonableness and specificity of our predictions. Additionally, we employ instruction tuning to enable smaller LLMs to achieve competitive performance, supporting practical deployment.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2506.02713

Country:

Asia > China > Shanghai > Shanghai (0.05)
Asia > China > Beijing > Beijing (0.05)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.68)

Industry:

Information Technology (0.68)
Consumer Products & Services (0.68)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

MMXU: A Multi-Modal and Multi-X-ray Understanding Dataset for Disease Progression

Mu, Linjie, Huang, Zhongzhen, Qin, Shengqian, Zhu, Yakun, Zhang, Shaoting, Zhang, Xiaofan

arXiv.org Artificial IntelligenceFeb-17-2025

Large vision-language models (LVLMs) have shown great promise in medical applications, particularly in visual question answering (MedVQA) and diagnosis from medical images. However, existing datasets and models often fail to consider critical aspects of medical diagnostics, such as the integration of historical records and the analysis of disease progression over time. In this paper, we introduce MMXU (Multimodal and MultiX-ray Understanding), a novel dataset for MedVQA that focuses on identifying changes in specific regions between two patient visits. Unlike previous datasets that primarily address single-image questions, MMXU enables multi-image questions, incorporating both current and historical patient data. We demonstrate the limitations of current LVLMs in identifying disease progression on MMXU-\textit{test}, even those that perform well on traditional benchmarks. To address this, we propose a MedRecord-Augmented Generation (MAG) approach, incorporating both global and regional historical records. Our experiments show that integrating historical records significantly enhances diagnostic accuracy by at least 20\%, bridging the gap between current LVLMs and human expert performance. Additionally, we fine-tune models with MAG on MMXU-\textit{dev}, which demonstrates notable improvements. We hope this work could illuminate the avenue of advancing the use of LVLMs in medical diagnostics by emphasizing the importance of historical context in interpreting medical images. Our dataset is released at \href{https://github.com/linjiemu/MMXU}{https://github.com/linjiemu/MMXU}.

information, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2502.11651

Country:

Asia > Middle East > UAE (0.28)
North America > Mexico (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Early evidence of how LLMs outperform traditional systems on OCR/HTR tasks for historical records

Kim, Seorin, Baudru, Julien, Ryckbosch, Wouter, Bersini, Hugues, Ginis, Vincent

arXiv.org Artificial IntelligenceJan-20-2025

We explore the ability of two LLMs -- GPT-4o and Claude Sonnet 3.5 -- to transcribe historical handwritten documents in a tabular format and compare their performance to traditional OCR/HTR systems: EasyOCR, Keras, Pytesseract, and TrOCR. Considering the tabular form of the data, two types of experiments are executed: one where the images are split line by line and the other where the entire scan is used as input. Based on CER and BLEU, we demonstrate that LLMs outperform the conventional OCR/HTR methods. Moreover, we also compare the evaluated CER and BLEU scores to human evaluations to better judge the outputs of whole-scan experiments and understand influential factors for CER and BLEU. Combining judgments from all the evaluation metrics, we conclude that two-shot GPT-4o for line-by-line images and two-shot Claude Sonnet 3.5 for whole-scan images yield the transcriptions of the historical records most similar to the ground truth.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2501.11623

Country:

Europe > Belgium > Brussels-Capital Region > Brussels (0.05)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
Europe > Middle East > Malta (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

'Hold on to your seats': how much will AI affect the art of film-making?

The GuardianJul-27-2024, 09:07:20 GMT

Last year, Rachel Antell, an archival producer for documentary films, started noticing AI-generated images mixed in with authentic photos. There are always holes or limitations in an archive; in one case, film-makers got around a shortage of images for a barely photographed 19th-century woman by using AI to generate what looked like old photos. Which brought up the question: should they? And if they did, what sort of transparency is required? The capability and availability of generative AI – the type that can produce text, images and video – have changed so rapidly, and the conversations around it have been so fraught, that film-makers' ability to use it far outpaces any consensus on how.

film-maker, geduldick, generative ai, (14 more...)

The Guardian

Country:

Europe > France (0.06)
North America > United States > New York (0.05)
Europe > Russia > North Caucasian Federal District > Chechen Republic (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.04)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.43)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.43)

Add feedback

Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations?

Salami, Hossein, Smith-Goettler, Brandye, Yadav, Vijay

arXiv.org Artificial IntelligenceApr-23-2024

General purpose Large Language Models (LLM) such as the Generative Pretrained Transformer (GPT) and Large Language Model Meta AI (LLaMA) have attracted much attention in recent years. There is strong evidence that these models can perform remarkably well in various natural language processing tasks. However, how to leverage them to approach domain-specific use cases and drive value remains an open question. In this work, we focus on a specific use case, pharmaceutical manufacturing investigations, and propose that leveraging historical records of manufacturing incidents and deviations in an organization can be beneficial for addressing and closing new cases, or de-risking new manufacturing campaigns. Using a small but diverse dataset of real manufacturing deviations selected from different product lines, we evaluate and quantify the power of three general purpose LLMs (GPT-3.5, GPT-4, and Claude-2) in performing tasks related to the above goal. In particular, (1) the ability of LLMs in automating the process of extracting specific information such as root cause of a case from unstructured data, as well as (2) the possibility of identifying similar or related deviations by performing semantic search on the database of historical records are examined. While our results point to the high accuracy of GPT-4 and Claude-2 in the information extraction task, we discuss cases of complex interplay between the apparent reasoning and hallucination behavior of LLMs as a risk factor. Furthermore, we show that semantic search on vector embedding of deviation descriptions can be used to identify similar records, such as those with a similar type of defect, with a high level of accuracy. We discuss further improvements to enhance the accuracy of similar record identification.

deviation, incident, language model, (15 more...)

arXiv.org Artificial Intelligence

2404.15578

Country: North America > United States > New Jersey > Union County > Rahway (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Download: tracing a mysterious covid strain, and fighting dengue with drones

MIT Technology ReviewMar-22-2024, 13:10:00 GMT

Historians have started using machine learning to examine historical documents, including astronomical tables like those produced in Venice and other early modern cities. Proponents claim that the application of modern computer science to the past helps draw connections across a broader swath of the historical record than would otherwise be possible, correcting distortions that come from analyzing history one document at a time. But it introduces distortions of its own, including the risk that machine learning will slip bias or outright falsifications into the historical record. The way sea sponges pump water is really quite amazing. I never thought I'd be transfixed by a bed making competition, but here we are.

download, historical record, mysterious covid strain, (3 more...)

MIT Technology Review

Country: Oceania > Australia (0.10)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.40)
Health & Medicine > Therapeutic Area > Immunology (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.56)

Add feedback